Skip to content

Optimize AtomicNonlocal term for the GPU#1265

Merged
mfherbst merged 2 commits intoJuliaMolSim:masterfrom
abussy:nonlocal
Apr 19, 2026
Merged

Optimize AtomicNonlocal term for the GPU#1265
mfherbst merged 2 commits intoJuliaMolSim:masterfrom
abussy:nonlocal

Conversation

@abussy
Copy link
Copy Markdown
Collaborator

@abussy abussy commented Feb 24, 2026

This PR optimizes the AtomicNonlocal term instantiation and forces for the GPU. The same principles as #1163 and #1262 are applied. The largest performance impact is felt in stress and response calculations, when Duals are involved. However, since the AtomicNonlocal term was never fully ported to the GPU, standard SCF and forces calculations are also accelerated.

@mfherbst mfherbst enabled auto-merge (squash) April 19, 2026 20:48
@mfherbst mfherbst merged commit b034315 into JuliaMolSim:master Apr 19, 2026
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants